ALGEBRAIC DEGREE OF POLYNOMIAL OPTIMIZATION 
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'66 , 

»' ' Abstract. Consider the polynomial optimization problem whose objective and constraints 

are all described by multivariate polynomials. Under some genericity assumptions, we prove 
that the optimality conditions always hold on optimizers, and the coordinates of optimizers 
^ ' are algebraic functions of the coefficients of the input polynomials. We also give a general 

D I formula for the algebraic degree of the optimal coordinates. The derivation of the algebraic 

degree is equivalent to counting the number of all complex critical points. As special cases, we 
' obtain the algebraic degrees of quadratically constrained quadratic programming (QCQP), 

second order cone programming (SOCP) and p-th order cone programming (pOCP), in 
analogy to the algebraic degree of semidefinite programming [H]. 
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1. Introduction 



Consider optimization problem 

min fo{x) 

(1.1) 



s.t. fi{x) = 0, i = 1, • • • ,me 

fi{x) > 0, i = me + 1, • • • ,m 



> 

i where fi{x) are multivariate polynomial functions in ]R[x] (the ring of polynomials in x = 

(xi, • • • ,Xn) with real coefficients). The recent interest on solving polynomial optimization 
. problems [3 [TUl [TT] by using semidefinite relaxations or other algebraic methods motivates 

\ this study of the algebraic properties of the polynomial optimization problem (jl.l|) . A fun- 

damental problem about (jl.ip is how the optimal solutions depend on the input polynomials 
fi{x). When the optimality condition holds and it has finitely many complex solutions, the 
optimal solutions are algebraic functions of the coefficients of polynomials fi{x), i.e., the co- 
^ ■ ordinates of optimal solutions are roots of some univariate polynomials whose coefficients are 

functions of the input data. An interesting and important problem in optimization theory is 
to study the properties of these algebraic functions, e.g., how big their degrees are, i.e., what 
is the number of complex solutions to the critical equations of (11. ih . Let us begin discussions 
with some special cases. 

The simplest case of (II. ip is the linear programming (LP), i.e., all polynomials fi{x) have 
degree one. In this case, the problem (II. ip has the form (after removing the linear equality 
constraints) 



(1.2) 




T 
C X 



Ax> b 



where c. A, b are matrices or vectors of appropriate dimensions. The feasible set of (jl.2p is 
now a polytope described by some linear inequalities. As is well-known, one optimal solution 
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X* (if it exists) of ()1.2p must occur at one vertex of the polytope. So x* can be determined by 
the hnear system consisting of the active constraints. When the objective c^x is changing, 
the optimal solution might move from one vertex to another vertex. So the optimal solution is 
a piecewise linear fractional function of the input data (c, A, b). When c, A, b are all rational, 
an optimal solution must also be rational, and hence its algebraic degree is one. 

A more general convex optimization which is a proper generalization of linear programming 
is semidefinite programming (SDP) which has the standard form 

T 
C X 

n 

Aq + Y^x^a, y 

i=l 

where c is a constant vector and the Ai are constant symmetric matrices. The inequality 
X y means the matrix X is positive semidefinite. Recently, Nie et al. [8] studied the 
algebraic properties of semidefinite programming. When c and Ai are generic, the optimal 
solution X* of (jl.3p is shown [8] to be a piecewise algebraic function of c and Ai. Of course, 
the constraint of (jl.3p can be replaced by the nonnegativity of all the principle minors of the 
constraint matrix, and hence (jl.Sh becomes a special case of (II. Ih . However, the problem 
p.3p has very special nice properties, e.g., it is a convex program and the constraint matrix 
is linear with respect to x. Interestingly, if c and Ai are generic, the degree of each piece 
of this algebraic function only depends on the rank of the constraint matrix at the optimal 
solution. A formula for this degree is given in [8]. 

Another optimization problem frequently used in statistics and biology is the Maximum 
Likelihood Estimation (MLE), which has the standard form 

(1.4) max pi(x)'^^p2(xV^ ■ ■■pn(xV" 

where Q is an open subset of M", the pi{x) are polynomials such that 'YiPi{x) = 1, and the 
Ui are given positive integers. The optimizer x* is an algebraic function of {ui, . . . , Un). This 
problem has recently been studied and a formula for the degree of this algebraic function has 
been found (cf. [B [5] ) ■ 

In this paper we consider the general optimization problem (jl.ip . when the polynomials 
/o,/i,...,/m define a complete intersection, i.e., their common set of zeros has codimension 
m + 1. We show that an optimal solution is an algebraic function of the input data. We 
call the degree of this algebraic function the algebraic degree of the polynomial optimization 
problem (jl.ip . Equivalently, the algebraic degree equals the number of complex solutions to 
the critical equations of (jl.ip . when this is finite. Under some genericity assumptions, we 
give in this paper a formula for the algebraic degree of (jl.ip 

Throughout this paper, the words "generic" and "genericity" are frequently used. These 
words are given a precise meaning in algebraic geometry. Some property or condition holds 
"generically' means it holds in some Zariski open set (a set described by polynomial inequal- 
ities 7^). Any statement that is proved under such a genericity hypothesis will be valid for 
all data that lie in a dense, open subset of the space of data, and hence it will hold except 
on a set of Lebesgue measure zero. 

The algebraic degree of polynomial optimization (jl.ip addresses the computational com- 
plexity at a fundamental level. To solve (jl.ip exactly essentially reduces to solving some 
univariate polynomial equations whose degrees are the algebraic degree of (jl.ip . As we can 
see later, the algebraic degree might be very big. 



(1.3) 
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s.t. 



ALGEBRAIC DEGREE OF POLYNOMIAL OPTIMIZATION 



3 



The paper is organized as follows. Section 2 derives a general formula for the algebraic 
degree, and Section 3 gives the formulae of the algebraic degrees for special cases like quadrat- 
ically constrained quadratic programming, second order cone programming, and p-th order 
cone programming. 

2. A GENERAL FORMULA FOR ALGEBRAIC DEGREE 

In this section, we shall derive the formula for the algebraic degree of polynomial opti- 
mization problem (jl.ip . when the polynomials define a complete intersection. Suppose the 
polynomial fi{x) has degree di. Let x* be one local or global optimal solution of (jl.ip . 

At first, we assume all the inequality constraints are active, i.e., nif, = m, and the co- 
efficients of polynomials /i, /2, • ' ' i fm are generic. When m = n, by Bertini's Theorem [H 
§17.16], the feasible set of (II. ID is finite and hence the algebraic degree is equal to the Bezout's 
number did2 • • • dm- So, without loss of generality, assume m < n. If the variety 

y = {xGC": f^{x) = --- = fmix)=0} 

is smooth at x*, i.e., the gradient vectors 

V/l(x*),V/2(x*),--- ,Vf.m{x*) 

are linearly independent, then the Karush-Kuhn- Tucker (KKT) condition holds at x* (Chap- 
ter 12 in [H]), i.e., 

' m 

(2.1) J V/o(x*) + EA*V/,(x*) = 

1 fiix*) = ~-- = fmix*) = 

where A^, ■ • • , AJ^ are Lagrange multipliers for constraints fi{x) = 0, • • • , fm{x) = 0. Thus 
the optimal solution x* and Lagrange multipliers A* = (A^, • • • , X^) are determined by the 
polynomial system (12. ip . The set of points x* in solutions to (12. ip forms the locus of critical 
points of (11. ip . If the system (12. ip is zero-dimensional, then, by elimination theory [2], the 
coordinates of the points x* are algebraic functions of the coefficients of the polynomials fi. 
Each coordinate x* can be determined by some univariate polynomial equation like 

(x*)^^ + ai(x*)^'-i + • • • + as^^ix* + 05^ = 

where aj are rational functions of the coefficients of the fi. Interestingly, when /i, /2, /m 
are generic, the KKT condition always holds at any optimal solutions, and the degrees 5i 
are equal to each other. This common degree counts the number of solutions to (12. ip . i.e., 
the cardinality of the critical locus of (jl.ip or, by definition, the algebraic degree of the 
polynomial optimization (jl.ip . We will derive a general formula for this degree. 

In what follows, we work on the complex projective spaces, where the above question may 
be answered as a problem in intersection theory. For this we need to translate the optimization 
problem to a relevant intersection problem. Let P" be the n-dimensional complex projective 
space. A point 5; € P" is a class of vectors (xq, xi, - ■ ■ , that are parallel to each other. A 
variety in P" is a set of points x that satisfy a collection of homogeneous polynomial equations 
in (xo,xi, • • • ,Xn)- Let fi{x) = Xq' /{{x/xq) be the homogenization of fi{x). Define U to be 
the projective variety in P" as 

U = {xe¥^: f\{x) = hix) = ■■■= ~fm{x) = 0}. 
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V/i(x) 



Next, we let 

d I _ _ d i 

dxQ J * dxn * 

be the gradient vector, with respect to the homogeneous coordinates. Notice that {-^fi = 
Xq'~^ g|-/j(x/xo)), so the homogenization of V/j coincides with the last n coordinates in V/j. 
In this homogeneous setting, the optimality condition for problem (jl.ip with m = me is 



(2.2) 



[x, /i) G X 



/o(i)-rf = /i(x) 



rank[V(/o(5;) + /xx^°),V(/i(x)),...,V(/;„(x))] <m 



where /i € M is the critical value. Let x* G {xq 7^ 0} be a critical point, i.e., a solution to 
()2.2p . We may eliminate /i by asking that the matrix 



fo{i*) fi{x*) 



X, 



do 











have rank one, and the matrix 



do 



-fm{x*) 



a 

dXn 



Mx*) ^h{x*) 



a 

axn 



Efm{x*) 







have rank m + 1. The first condition and the condition xq ^ mean that our critical points 

r GZ^ = {A(x) = --- = /m(5) = o} 

while the rank of the second matrix equals m + 1 at points where xq 7^ only if the submatrix 



M 



I-Joix) £-Mx) ... ^J^{x)- 



.i-Mx) at/i(x) ••• ^Mx) 



has rank m. Therefore we define W to be the projective variety in P": 

W = {i G P" : ah the (m + 1) x (m + 1) minors of M vanish } , 

the locus of points where the rank of [V(/o), V(/m)] is less than or equal to m. Denote 
the class of (1, xi, • • • , x„) in by x. 

Proposition 2.1. Assume m = rUe- If the polynomials /i, • • • , fm o,i"g generic, then we have: 

(i) The affine variety 1/ = {x G C" : /i(x) = • • • = /^(x) = 0} is smooth. 

(ii) The KKT condition holds at any optimal solution x* ; 

(iii) // /o is also generic, the affine variety 

(2.3) if = |x G y : 3Ai, - •• ,Am such that \/ fo{x) + Y^XiV fi{x) = o| 

defined by KKT system i2. 1]) is finite. 
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Proof, (i) When polynomials /i,-- - , fm are generic, by Bertini's Theorem [H §17.16], the 
variety U has codimension m and is smooth, in particular the affine subvariety V = Uri{xo ^ 
0} is smooth. In terms of the Jacobian matrix 



_d 

dx 



S^frn{x) 
d^fm{x) 



its rank is full at x. Furthermore, the tangent space of y at x is, of course, not contained in 
the hyperplane = at infinity, so the column [l • • • O]"'" is not in the column space 
of the matrix at x. Therefore already the submatrix 



^Jlix) ^J2{X) 



has full rank at x, i.e., the gradients 

V/l(x) ••• Vfmix) 

are linearly independent at x G y. 

(ii) When x* is one optimizer, which must belong to V, by (i), we know the gradients 

V/i(x*),V/2(x*),--- ,V/„(x*) 

are linearly independent. Hence the KKT condition holds at x* (Chapter 12 in [9]). 

(iii) We claim that the intersection U nW defined above is finite. Since our critical points 
y n W is a subset of U Ci W, (iii) would follow. The codimension of U is m, and this variety 
is smooth, so the matrix M has by (i) rank at least m at each point of U. The variety 
U n {fo{x) = 0}, is, by Bertini's Theorem, also smooth, so as above, the matrix M has full 
rank at points in the affine part V fl {foix) = 0}. On the other hand, M is the Jacobi 
matrix for the variety U fl {fo{x) = 0}. This variety is again smooth and has codimension 
m + 1 in the hyperplane {xq = 0}, so M must have full rank m + 1 on W n {fo{x) = 0}. 
The variety W where M has rank at most m, therefore cannot intersect U fl {/o(i) = 0}. 
But Bezout's Theorem [3l §8.4] says that if the sum of the codimensions of two varieties 
in P" does not exceed n, then they intersect. In particular, any curve in U intersects the 
hypersurface {fo{x) = 0}. Since U fl {fo{x) = 0} has codimension m + 1, we deduce that 
W must have codimension at least n — m. Furthermore, since any curve in U CiW would 
intersect {fo{x) = 0}, the intersection U nW must be empty or finite. On the other hand, 
the variety of n x (m + l)-matrices with homogeneous forms as entries having rank no more 
than m has codimension at most n — m. So the codimension of W equals n — m. Hence U 
and W have complementary dimensions. Therefore the intersection ^ n W is non-empty and 
(iii) follows. □ 

By Proposition 12.11 for generic /i , • • • , fm the optimal solutions of (jl.ip can be character- 
ized by the KKT system (12. ID . and for generic objective function /o the KKT variety K is 
finite. Geometrically, the algebraic degree of the optimization problem (II. ip is, under these 
genericity assumption, equal to the number of distinct complex solutions of KKT, i.e., the 
cardinality of the variety K which we above showed to coincide with V fl W. The variety 
U nW above clearly contains K. On the other hand, ^ n W is finite and does not intersect 
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the hyperplane {xq = 0} when polynomials fi are generic. Since U — V = U (1 {xq = 0} and 
the ZYn Wn {xo = 0} = 0, we can see that the cardinality of K coincides with the cardinality, 
i.e., the degree ofU (IW. 

For integers (ni,n2, ■ ■ ■ ,nk), define the symmetric sum of products as follows 

(2.4) Dr{ni,n2, • • • , nfc) = ^ ■ ■ ■ . 

«l+«2H l-jfe=r 

Theorem 2.2. Assume m = rUe- If the polynomials /o,/i,'"" ^ fm generic, then the 
algebraic degree of is 

did2 ■ • • dmDn~m{do — 1, di — 1, • • • , dm — 1). 

Furthermore, if some fi is not generic and the system \2. 1\) is zero- dimensional, then the 
above formula is an upper bound of the algebraic degree. 

Proof. When /i, /2, ■ • • , /m are generic, lA is a. smooth complete intersection of codimension 
m. Its degree degiU) = did2 ■ ■ ■ dm- When /o is also generic, W has codimension n — m and 
intersects U in a finite set of points as shown above. If the intersection U DW is transverse 
(i.e., smooth) and hence consists of a collection of simple points, then the degree deg (UnW) 
counts the number of intersection points ofU D W, and hence the cardinality of KKT variety 
K, which is also the number of solutions to the KKT system (j2.ip for problem (jl.ip . 

To show that this intersection is transversal, we consider the subvariety X in P" x P*" 
defined by the m equations /i = /2 = ••• = /m = and the n equations 

M ■{Xo,...,Xmf = 0, 

where the Aj are homogeneous coordinate functions in the second factor. The image under the 
projection of the variety X defined by these m + n polynomials into the first factor coincides 
with the finite set U D W. Since M has rank at least m at every point of U, there is a unique 
A = (Ao, Am) G P"^ for each point x & U nW such that {x, A) lies in X. Therefore the X 
is a complete intersection. It is easy to check that this complete intersection does not have 
any fixed point when the coefficients of /o varies. So Bertini's Theorem [U §17.16] applies to 
conclude that for generic /o this complete intersection is transversal, which implies that the 
intersection Z// H W in P" is also transversal. 

Since the intersection U (IW is finite, i.e., has codimension in P*^ equal to the sum of the 
codimensions of U and W, Bezout's Theorem (cf. [H §8.4], [U Theorem 18.3]) applies to 
compute the degree 

deg{U nW)= deg{U) ■ deg{W). 

To complete the computation, we therefore need to find deg(yV). Since the codimension of 
W equals the codimension of the variety defined by the (m + 1) x (m + 1) minors of a general 
n X (m + 1) matrix with polynomial entries, the formula of Thom-Porteous-Giambelli [3l 
§14.4] applies to compute this degree: The degree of W equals the degree of the determinantal 
variety of n x (m + 1) matrices of rank at most m, in the space of matrices whose entries 
in the i-th column are generic forms of degree di — 1. These matrices may be considered as 
a collection of linear maps parameterized by P". More precisely, they define a map between 
vector bundles of rank m + 1 and n + 1 over P" 

M : Opn(-do + 1) © Opn{-dl + 1) © • • • © Opn(-dm + 1) > 0'^ + \ 

and W C P" is the variety of points over which the map has rank m. The Thom-Porteous- 
Giambelli formula computes the degree in terms of the topological Chern classes of these 
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vector bundles: The degree equals the degree of the Chern class 

cn-m((c;+^) - (Op- (-4 + 1) e Opn(-di + 1) e • • • Opn{-d^ + 1))) 

which coincides with the coefficient of t"""^ in 

1 _ 
(1 - (do - l)t) ■ ... ■ (1 - (d„ - l)t) ~ 
(1 + (do - l)t + (do - ift^ + ■■■)■■■{! + {dm. -l)t + {d^- ift^ + •••)• 

Thus deg(W) is the complete homogeneous symmetric function of degree codim(>V) evaluated 
at the column degree of G, which is Dn-mido — 1, di — 1, ■ • • , dm — !)• Therefore the degree 
formula for the critical locus U nW and hence the algebraic degree of (jl.ip is proved. 

When some polynomial fi is not generic, then a perturbation argument can be applied. Let 
X* be one fixed optimal solution of optimization problem (11. Apply a generic perturbation 
Ae/j to each fi so that {fi + A^fi)(x) is a generic polynomial and the coefficients of A^/j tends 
to zero as e ^ 0. Then one optimal solution x*{e) of the perturbed optimization problem 
p.ip tends to x*. By genericity of {fi + A^fi){x), we know 

ao(e)(x*(e))^ + ai{e){x*{e))'~' + ■■■ + as^i{e)x*i{e) + as{e) = 0. 

Here 6 = did2 ■ • • dmDn-m{do — 1, di — 1, • • • , dm — 1) and aj(e) are rational functions of the 
coefficients of fi and A^/j. Without loss of generality, we can normalize aj{e) such that 

max |aj(e)| = 1. 

0<j<o 

When e ^ 0, by continuity, we can see that x* is a root of some univariate polynomial whose 
degree is at most 5 and coefficients are rational functions of the coefficients of polynomials 
fo, fi, - ■ ■ , fm- □ 

Remark 2.3. The genericity assumption in the theorem is used to conclude that the critical 
locus Z//nyV is a smooth 0-dimensional variety by appealing to Bertini's Theorem [H §17.16]. 
A sufficient condition for Bertini's Theorem to apply can be expressed in terms of the sets 
Ui of polynomials in which the polynomials /o, /i, . . . , /m can be freely chosen. First assume 
that the generic polynomial in each Ui is reduced, and that Ui intersects every Zariski open 
set of a complex affine space Vi. Secondly, assume that the set of common zeros of all the 
polynomials in U^q^ empty. Then Bertini's Theorem applies. In fact, the polynomials fi 
for which the conclusion of Bertini's Theorem fails are contained in a complex subvariety of 
Vi. 

If some of the polynomials fi are reducible, then we may replace fi by the factor of least 
degree that contains the optimizer. The original problem (jl.ip . is then modified to one with 
a smaller algebraic degree. This is relevant in the above context, if the generic polynomial in 
Ui is reducible. 

Example 2.4. Consider the following special case of problem (II. ip 

fo{x) = 21x1 - Q2xixl - 70x1x3 - 95xj - Alxix^ + 51x1x1 + 47x1 + ^^i^t + SS^, 

fi{x) ^ 88xi + 64x1X2 - 22xiX3 - 37x1 + 682:1X2X3 - 84x2 + 80X3X3 + 28x3X3 - 20x2X3 - 7X3, 

f2{x) = 31 — 45xiX2 + 24xiX3 — 75x3 + 16x^ — 44x^X3 — 70xiX2 — 28x1X2X3 — 67x3X3 — 97x2X3. 

Here m = rrie = 2. By Theorem 12. 2( the algebraic degree of the optimal solution is bounded 
by 

4 • 3 • -Di(4, 3, 2) = 12 • (4 + 3 + 2) = 108. 
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Symbolic computation shows the optimal coordinate xi is a root of the univariate polynomial 
of degree 108 (whose coefficients are modulo 17) 

7-1™ -I- St^'"' + Yr^'"' 4- 4-r^°-' - T-^" - T^"^ -I- - 7-r^°° - 7r^^ + 7t°** -I- -St^^ - - fiT^=* + - St^^ 4- Rt™ 

X ^ OX ^ ^ ( Jj ^ '±X^ — Jj — Jj ^ ^ ^Ju ^ — I JL ^ — (J. ^ ^ ( Ju ^ OX^ — 'IXj^ — UX^ ^ ^JU ^ — OX ^ UX ^ 

, . 89 , c 88 , o 87 1 e 86 ^85 ^ 84 , „ 83 c 82 „ 81 , o 80 . 79 78 o 77 , 76 ^ 75 , ^ 74 

+ 4a;^ + bx^ + Zx-^ + bx-^ — 7a:^ — ox^ + oa;^ — bx-^ — 6x^ + oa:^ — 4x;^ ^2;^ — zx;^ + x^ — -ia;^ + dXj^ 
-\- 7x^Y + 4x{'^ + Zx^-^ ~ 4x]^" — 8x^^ — x\' — x^^ + 1x^^ H- 6x^^ — 4a;^'^ + hx^-^ + 2x^^ + 43;^*^ — 2a:^^ — 5x^^ + Ix^-^ 

„ 56 . . 55 . „ 54 o 53 „ 52 „ 51 .50 „ 49 . - 48 „ 46 . „ 45 „ 44 . _ 43 . . 42 - 41 40 

— oxj^ + oa^i + ox^ — 8a:i — Ix-^ — Ax^ — \x^ — ix^ + oxj^ — bx-^ H- bxj^ — bx-^ + ox^ + 5a:i — ox^ — x^ 

. c 39 .38 Q 37 , . 36 „ 35 34 c 33 „ 32 , „ 31 . „ 30 . „ 29 . . 28 n 27 _ 26 / 25 . „ 24 

+ ox^ — 4x^ — Jx^ + ox^ — 2x^ ~ ~ ~ ' ' ox^ + oXj^ + 4Xj^ — 8x^ — ox^ — 4x^ + 2x-^ 

— x^"^ + 2x^^ + 3x^^ + 2x'^'^ + 4x}^ + 6x}^ + bx]^ — 7x\^ — 2x}'^ — x\^ — 7x}^ + 5x}^ + 2x}^ — Sx]^ — 5x^ — 5x^ 

— 3x^ — 2x^ — 7x^ — 2x^ — Gx'^ — 3xi — 1. 

In this case the degree bound 108 is sharp. 



Now we consider the more general case that m > me, i.e., there are inequality constraints. 
Then a similar degree formula as in Theorem 12.21 can be obtained, when the active set is 
identified. 

Corollary 2.5. Let x* he one optimizer and ji,--- ,jk be the active set of inequality con- 
straints. If every active fi is generic, then the algebraic degree of x* is 

dl - ■ ■ dmedji ■ ■ ■ djj,Dn-me-k{do — 1, dl — 1, • • • , drrie — Ij djj — 1, • • • ,djf, — 1). 

If some fi is not generic and the system 12. 1\) is zero- dimensional, then the above formula is 
an upper bound of the degree. 

Proof. Note that x* is also an optimal solution of polynomial optimization problem 

min fn(x) 

s.t. fi{x) = 0, z = 1, • • • ,me 
fi{x) = 0, z = ji, • • • , jfe 

Hence the conclusion follows from Theorem 12.21 □ 



3. Some special cases 

In this section we derive the algebraic degree of some special polynomial optimization 
problems. The simplest special case is that all the polynomials fi in (jl.l|) have degree one, 
i.e., (II. ip becomes one linear programming of the form (II. 2p . If the objective c is generic, 
precisely n constraints will be active. So the algebraic degree is Do(0, 0, • • • ,0) = 1. This is 
consistent with we have observed in Introduction. Now let us look at other special cases. 

3.1. Unconstrained optimization 

We consider the special case that the problem (jl.ip has no constraints. It becomes an 
unconstrained optimization. Its optimal solutions makes the gradient of the objective vanish. 
By Theorem 12. 21 the algebraic degree is bounded by Dn{do — 1) = (do — 1)", which is exactly 
the Bezout's number of the gradient polynomial system 

V/o(x) = 0. 

Since /o can be chosen freely among all polynomials of degree do, Remark 12.31 applies to show 
that the degree bound above is sharp. 
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Example 3.1. Consider the minimization of foix) given by 

fo~Xi+X2+ x| + + Xi + X2 + x"^ + — 13xi — 'iQx\X2 — Oxixa + hx\x^ + \\x\ 

— 8X3X2 — 8X3 — 20X3X4 — 13X2X4 — 9x4 + Xi — 2X2 + 12x3 — 13X4. 

For the above polynomial, the algebraic degree of the optimal solution is 3^ = 81. Symbolic 
computation shows the optimal coordinate x\ of x* is a root of the univariate polynomial of 
degree 81 (whose coefficients are modulo 17) 

x^-^ — + 6x^^ — 2;}'^ + 22;}' — 2a:{^ + 5a:{^ — 1x^^ + ^x^"^ + 8x|^ + GxJ*^' — x^-^ + 2x^^ — 'bx\' + 1x^^ — A,x^-^ — Zx^-^ 
+ 2a:^'^ + Sx^"^ + 7x^-^ + 5a: "j**^' + 4x^"' + Ix^^-^ — 2x\' — Sx^^ — 2x\^ — Sx^^^ + 2x^^^ — 82;^^ — x^^ + 8x^'^ — 4a:^'^ — 7x'^^ — x^ 
+ 5xf + 6xf - 3xf + 5xf - 4xf - 5xf + xf - Axf - 3xf + 8xf + ixf + 2xf - 32;f - 7xf - Axf - 5xf + Axf 

,29 c 28 o 27 - 26 „ 25 I ^ 24 , _ 23 , „ 22 I _ 21 I 20 .19 ^ 18 , . 17 , _ 16 , _ 15 

— 4a:^ — bx-^ — fix-^ — bx^ — fix-^ + bx-^ + Ix-^ + 2x^ + ox^ + ^l ~ Ax^ — bx^ + Ax^ + 7x^ + ox^ 
+ 7xY^ — Ztx^^ H- 8a:}^ — x^' — ^x\ — Ax\ — bx^^ + 7x\ + 2x\ + 'ix\ + lx\ — 3a:^ + Sxi — 6. 

The degree bound 81 is sharp for this problem. 



3.2. Quadratic constrained quadratic programming 

Consider the special case that all the polynomials /o,/i,'"" ; /m are quadratic. Then 
problem (II. ip becomes one quadratic constrained quadratic programming (QCQP) which 
has the standard form 

min xF Aqx + h^x + cq 



s.t. x'^AiX + bjx + Cj > 0, i = 1, • • • ,i. 

Here Ai,bi,Ci are matrices or vectors of appropriate dimensions. The objective and all the 
constraints are all quadratic polynomials. At one optimal solution, suppose m < i constraints 
are active. By Corollary 12.51 the algebraic degree is bounded by 



(3.1) 2-.D,_^(l, I,... ,1) = 2-. Yl 1 




io+h+i2-i Vim=n-m 



The polynomials /o, /i, fm can be chosen freely in the space of quadratic polynomials, so 
Remark 12.31 applies to show that the degree bound above is sharp. 

Example 3.2. Consider the polynomials 

/o — —20 — 27xi + 89xi.X2 + 8OX1X3 — 45xiX4 + 19xiX5 + 42xi — 13x2 + 31x2X3 — 79x2X4 

+ 74x2x5 — 9x2 + 56.X3 — 77x3x4 — 2x3x5 + 35x3 + 40x4 — 13x4x5 + 60x4 + 58xg — 84x5, 
/i = 33 + bbx\ — 41xiX2 + 33xiX3 — 6IX1X4 + 96x1X5 + 12xi + 74.X2 — 90x2X3 — 57x2X4 

— 52x2X5 + 5I.X2 + 15x3 + 8IX3X4 + 87X3.X5 + 75x3 — IO.X4 + 58x4X5 + 33x4 + 83.X5 — 23x5, 
/2 = 8 — 9x^ + 56xiX2 — 24xiX3 + 8IX1X4 + 86x1X5 — 99xi — 77x2 — 75x2X3 + .X2X4 + 38x2X5 

+ 23x2 — 97.X3 — 14x3X4 — 73x3X5 + 65x3 + 3x4 — 14x4X5 + 16x4 + 9x5 — IO.X5, 
/3 = 9 + 90xi — 94xiX2 — 22x1X3 — 24xiX4 + 78xi + 32x2 — 48x2X3 — 6x2X4 + 8OX2X5 — 18x2 — 63x| 

+ 66x3x4 — 13X3.X5 + 88x3 — 45x4 — 92x4x5 — 69x4 — 43x5 + 32x5. 

For the above polynomials, the QCQP problem is nonconvex. We consi der those local optimal 
solutions which make all the three inequalities active. By Corollary l2.5t the algebraic degree of 
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this problem is bounded by 2™'(^) = 80. Symbolic computation shows the optimal coordinate 
xi is a root of the univariate polynomial of degree 80 (whose coefficients are modulo 17) 

80 o ~9 I ~8 I n 77 I ^ 76 ^ 75 , . 74 73 , 72 , - 71 . 69 , . 68 67 , 65 , - 64 , 63 ^ G2 ^ 61 

— dx^ + bx^ + 2x^ + bx^ — 6x^ + Ax^ — bx^ + x^ + 7x^ — 4x^ + 4x^ — bx^ + x^ + ox^ + x^ — 2x^ — bx^ 
+ 8x1" + 7r.f + a)f - 7:^f + 8x'f - 5:cf - - 3:rf + :rf - 5a.f - 4.;^' + 3:rf - 2a.f - - 7xf + 2xf + 8a;f 

. „ 43 , 42 , _ 41 „ 40 . _ 39 . „ 38 . „ 37 . _ 36 . 35 , . 34 , . 33 . 32 31 . „ 30 29 . . 28 

+ bx^ — Ax-^ A- bx-^ — ix-^ + ox-^ + Ix-^ + Ix-^ + ox-^ + x-^ + 4x-^ + 4x-^ -\- x-^ ~ ' ^^i ~ '^i ' 

„ 27 , c 26 , c 25 , - 24 , o 23 , r: 22 21 , „ 20 , o 19 18 , ^ 17 15 , 14 

— 2x^ + OX-^ + bx-^ -\- OX-^ + OX^ + OX-^ ^ ^1 ' ^^1 ' ^^1 ^ ^1 ' '^1 ~ ^1 + ^1 

+ 4x^^ + 7a:;^^ — Sx^^ + 3a:^ + 6a;^ — x[ -\- Sx^ — 4x'^ + 8a;^ + a:;!^ — 2a;^ + 7a:i — 4. 

The algebraic degree of this problem is 80 and the bound given by formula p.ip is sharp. 

3.3. Second order cone programming 

The second order cone programming (SOCP) has the following standard form 
min c^x 

(3.2) a:eR" 

s.t. afx + bi — \\CiX + di\\2 > 0, i = I, - ■ ■ , i 

where c,ai,bi,Ci,di are matrices or vectors of appropriate dimensions. Let x* be one opti- 
mizer. Since SOCP is a convex program, the x* must also be a global solution. By removing 
the square root in the constraint, SOCP becomes the polynomial optimization 

T 

mm c X 

s.t. {afx + bif - {dx + difidx + di) > 0, i = 1, • • • , ^. 

Without loss of generality, assume that the constraints with indices 1, 2, • • • , m are active at 
X* . The objective is linear but the constraints are all quadratic. As we can see, the Hessian 
of the constraints has the special form aiof — CjCi. Let be the number of rows of Ci. 
When Tj = 1, the constraint afx + bi — \\CiX + di\\2 > is equivalent to two linear constraints 

-{afx + bi) < CiX + di< afx + bi. 

Thus, when every = 1, the problem reduces to a linear programming and hence has 
algebraic degree one, because in this situation the polynomial {afx + bi)^ — {CiX + dj)^ is 
reducible. When rj > 2 and ai,bi,Ci,di are generic, the polynomial {afx + 6j)^ — {CiX + 
di)^ {CiX + di) is quadratic of rank rj + 1 and hence irreducible. Without loss of generality, 
assume 1 = ri = r2 = ... = < r^+i < ••• l^f-m- Then problem p.2p is reduced to 

T 

mm c X 

s.t. afx + bi + ai{CiX + dj) > 0, i = 1, • • • , A: 

{afx + bif - {dx + di)'^{CiX + dj) > 0, i = A; + 1, • • • , m 

where scalar ai is chosen such that afx* + bi + ai{CiX* + di) = 0. By Corollarv 12.51 the 
algebraic degree of SOCP in this modified form is bounded by 

(3.3) 2"-'=. A.-™(0,-- 

rn-k times 

When k = m, we have already seen the algebraic degree is one. 
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For the sharpness of degree bound (j3.3p . we apply Bertini's Theorem following Remark 12 .31 
For every i = k + 1, ■ ■ ■ , m, define the set Ui of polynomials as 



, Or. G 



, m. 



Ui = I {af X + bi)'^ - ^ OjiCiX + di)'j : ai,' 

[ l<j<ri 

Then define affine spaces Vi as follows: 

Vi = I {afx + bif - I3j{ax + di)] : . . . e C I , i = A: + 1, 

Then every set Ui intersects any Zariski open subset of the affine space Vi. On the other 
hand the set of common zeros of the linear polynomials 

afx + bi + ai{CiX + di), i = 1, k 

and all the polynomials in the union U™^^,_,_^ Vi is contained in the set 

k ^ C T u n~l 

(3.4) Z=f|{xGM": afx + 6, + a,(Qx + d,) = 0} f] \^ ^^"^ ■ Cxtd = o\ ' 
i=i i=k+i ^ iX+ I ) 

Therefore, for generic choices a,, ^j, Cj, di, if rfc_|_i + - • ■+rm+m > n, the set Z is empty. Hence 
Remark [2.31 applies to show that, for generic choices of c, 6i, Cj, di, if rfc_|_i + - • •+rm+Tn > n, 
the algebraic degree bound 2'^~^ ■ ("^~^~^) is sharp. 

Example 3.3. Consider SOCP defined by polynomials 

/o = -xi + 6x2 + I3.T3 + lla;4 + 8x5, 

/i = (-11x1 - 18x2 " 4x3 + 2x4 - 12x5 + 7)^ - (-4xi - 10x2 + 20x3 - 4x4 - 8x5 + 3)^ 

- (-5x1 - 11x2 + 8x3 - 18x4 + 112:5 + 15)^ - (21x1 + 18x2 - 12x3 - IOX4 - 8x5 + 4)^, 
/2 = (-5x1 - 5x2 - 7x3 - 6x4 + 4x5 + 41)^ - (xi - 2x2 + IOX3 - 21x4 - H)^ 

- (-12x1 + 3x2 + 16x3 + 4x4 + 2:5 + 9)^ - (14x1 + 20x2 - 13x3 - 7x4 + 4x5 + 2)^, 
h = (a-'i - 8x2 + 11x3 -xz + 22f - (2x1 - X2 + 3x3 - X4 - 25x5 - 8)^ 

- (-2x1 - 17x3 + 14x4 + 4x5 - 7)^ - (xi + 12x2 + 14x3 - 6x4 - 4x5 - 10)^ 

There are no linear constraints. For this SOCP, all the three inequalities are active at the 
optimizer. All the matrices Ci has three rows. By formula (j3.3p . the algebraic degree of 
this problem is bounded by 2'^(3~J) = 48. Symbolic computation shows that the optimal 
coordinate xi is a root of the univariate polynomial of degree 48 (whose coefficients are 
modulo 17) 

_ o„47 _ 46 45 44 43 _ ^ 42 _ 41 40 _ . 39 _ - 38 „ 37 _ 36 35 34 _ 33 32 

— ^ — ox ^ ^ ox ^ ^ ^x ^ ^ ox^ — ux. ^ — ix ^ ^ ox ^ — ox ^ — ix^ ^ ix ^ — ox ^ zx. ^ zx. — (X^ ^ ux ^ 

, o 31 , o 29 I 28 c 27 26 , 25 , . 24 « 23 22 , - 21 ..^ 20 . 19 , ^ 18 o 17 , c 16 

+ ox^ + ox^ + x-^ — DX^ — ox^ + x^ + 4x^ — 7x^ — x^ + 5x^ + dx^ — 4x^ + 2x^ — 8x^ + 5x^ 
+ 8x}'' + 2x\'^ + 5x^"^ — 4x^^ + 7x\^ — 2x^^ — Ax\ + 4x^ + 4:x\ + 3x^ — 4x^ — 5x^ — Sx\ + x^ — xi — 1. 

The algebraic degree of this problem is 48, so the upper bound is sharp in this case. 

3.4. p-ih order cone programming 

The p-th order cone programming (pOCP) has the standard form 

mina;eMn c^x 

s.t. afx + bi — \\CiX + diWp > 0, i = 1, ■ ■ ■ ,£ 
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where c, ai,bi,Ci, di are matrices or vectors of appropriate dimensions. This is also a convex 
optimization problem. Let x* be one optimizer, and assume the constraints with indices 
1, • • • ,m are active at x* . Suppose the matrices Ci has rj rows. When some rj = 1, the 
constraint ajx + 6j — \\CiX + di\\p > is equivalent to two linear constraints 

— {ajx + hi) < CiX + di < afx + bi. 

Like the SOCP case, assume 1 = ri = • • • = < rk+i < • • • < r^- Then problem (|3.5p is 
equivalent to 

T 

mm c X 

s.t. afx + bi + ai{CiX + di) >f), i = 1, . . . ,k 

{afx + bif - ^{CiX + difj > 0, i = fc + 1, . . . , m 

where cTj is chosen such that afx* + bi + ai{CiX* + di) = 0. In this situation 

^ ^ ^ ^ — ^ 

. ifc + iH Hm=rL~m 

m~k times 

By Corollary 12.51 the algebraic degree of x* is therefore bounded by 

' n — k — 1 



m — k — I 



(3.6) 1)"-™ 



m — k — 1 



When k = m, problem ()3.5p is reducible to some linear programming and hence its algebraic 
degree is one. 

Now we discuss the sharpness of degree bound (13. 6p . Similarly to the SOCP case, for every 
i = k + 1, ■ ■ ■ ,m, define the set of polynomials Ui as 

Ui= I {afx + bi)P - Y ot]iCiX + dif^ : ai, • • • , a,.^ G : 

Then define affine spaces Vi as follows: 



Vi = \ {afx + bi)P - Y l^i^'^i^ + d^fl : A, • • • G C }> , i = A: + 1 

l<j<ri 



771. 



Then every set Ui intersects any Zariski open subset of the affine space Vi. On the other 
hand, the set of common zeros of the linear polynomials 

afx + bi + ai{CiX + di), i = 1, k 

and all the polynomials in the union IJS=fc+i ^ contained in the set Z defined by (j3.4p . 
Therefore, for generic choices of ai,bi,Ci,di, if r/j+i + ■ ■ ■ + + m > n, the set Z is empty, 
and hence Remark 12.31 implies that the degree bound given by formula (j3.6p is sharp. 

Example 3.4. Consider the case p = 4 and the polynomials 

/o = 9x1 — 5x2 + 3x3 + 2x4 

/l = (1 — 6xi — 6x2 + 4x3 — 9x4)* — (7 — 6x1 + 22x2 — X3 + X4)'' — (11 + xi — X2 — 8x3 + 3x4)"* 

- (-13 + 7x1 + 16x2 - 7x3 + 9x4)'* - (3 - llxi + 14x2 - 8x3 + 5x4)'' - (8 + 9xi - 10x2 + 2x3 + 2x4)''. 
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For the above polynomials, the inequality constraint must be active since the objective 
is linear. By formula (j3.6p . the algebraic degree of the optimal solution is bounded by 
p"^{p — = 108. Symbolic computation shows the optimal coordinate xi is a root 

of the univariate polynomial of degree 108 (whose coefficients are modulo 17) 

108 _ 107 _ 106 105 104 _ 103 _ ,102 _ 101 100 99 _ 98 . 97 _ 96 _ 95 

•ij — OX-|^ — OJ.-|^ i Jj OX-|^ — jLiJj-^ — ^Jj-^ — UX-|^ jLiJ.-^ ^ OJj-^ — OX -j^ O-L-^ — OX-j^ — OX-|^ 

, . 94 , n 93 , ^ 92 . 91 , e 90 I 89 , 88 87 c S6 ^ 85 , 84 , r- 83 81 , 80 , „ 79 

e 78 , _ 77 , ^ 76 „ 75 , . 74 c 73 ^ 72 , 70 , „ 69 68 , o 67 „ 66 , r- 65 , . 64 , 63 

- bx-^ + tx-^^ + 2,x-^^ ~ 6x^ 4- 4a;-|^ — ox-^^ — bx-^ + x-^ + 2x-^ ~ x-^^ + Sx-^^ — ox^ + ox-^ + 4a;-|^ + a;-,^ 
+ x^"^ — 2x1^ — x^^ + 3a;^^ — 7a;^^ — 7x^' + 7a;^^ — 3a;^'^ — 3a;';^'' — Sa:^'^ — 4x^^ — 4x^^ — 3x'^^ — 4a;^^ + a;^'^ 
+ Srcf + 4:rf - ixt" - 8^f - 8:tf - 7xf - Sxf' + 4xf - 5x1" - 7xf + ixf - 2a:f + + 6a:f + &xf 

- 7xf - ?.xf - 5xf + 7xf + 3i^f + exf + 1x\^ - Srcf - 8xf - Ax\^ + 8xf + Sx'f - aa.}** + ex\' - 5xf 

- SkJ^ + Sa" + 8^;^ + &x\^ - bx]° + 32-1 + 22-1 - 22:1 + &x\ + Ax\ + Txf - Sx^ + 4xi + 5. 

So the algebraic degree of this problem is 108, and the bound given by formula ()3.6p is sharp. 
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